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MEMORANDUM FOR FILE 


' languaee based on the earlier language B [11. Tr.e languages and their compilers 

c J LL. c iniroduc-s ihs notion of types, and defines apptopttate extra syntax and 

;™nncrrarc or.he noM ns a K„e ootnpder. producing tnaehtn. code wfiere B Prtxi.ced .ntc 

p re live code. 

Most of the software for the UNix time-sharing system [2l is written in C ^ ts the operaimg s>s- 
• ir r- jo tIcq available on the HIS 6070 computer at Murray Hill and and on the iB.t S.s 
a- 5o JeTl TOs paper is a manual only for the C language ioelf as itnpletnen.ed on 
The m. iT. however! hints are given occasionally in ihe text of implementarionajependettt features. 

The IMX Programmer's Manual [4l describes the library routines J 

rJ ^ -n Thr* Trored'’rcs for compiling programs under that system. The GCOS C Library by 

1 ps^and Barres [M describes routines available under that system as well as compilation pr^edures. 

tie and the author's previous introductions to allegedly impenetrable subjects. 

2 Lexical conventions , 

Ther’ a'e s x ktnds of tokens: identifiers, keywords, constants, strings, express.onyoperato^ and 

other 'pa-or" In general blanks, tabs, new'ines, and comments as described below are 'gnofcd ex- 
??pri^'fher serve to separate tokens. At least one of these characters is required to separate oi..er- 
wise adjacent identifiers, constants, and certain operator-pairs. 

If the input stream has been parsed into tokens up to a given character, the next token is taken to 
include the longest string of characters which could possibly constitute a token. 

2.1 Commenis 

The characters /* introduce a comment, which terminates with the characters •/. 

2.2 Identifiers (Names) ,u - • Tv,».„.--r 

vn identifier is a seouence of letters and digits; the first character must be alphaoetic^_ The unu r- 

score counts as alpha'oetic. Upper and iower case letters are considered different. i o mor 
the first eight characters are significan-. and only the first seven for external identifiers. 

2.3 Kevwords , . ^ ^ 

The following iden.ifiers are reserved for use as keywords, and may not be used otherwise. 
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int 

c'^ar 

float 

double 

struct 

auto 

extern 

register 

static 

goto 

return 

sizeof 


break 

continue 

if 

else 

for 

do 

while 

switch 

case 

default 

entry 


The entry keyword is not currently implemented by any compiler but is reserved for future use. 

2.3 Constinis 

There are several kinds of constants, as foilows: 

2.3.1 Integer constants 

An integer constant is a sequence of digits. An integer is uken to be octal if it begins with 0, de¬ 
cimal otherAise. The digits 8 and 9 have octal value 10 and 11 respectively. 

2.3.2 Character constants 

A charaCiCr constant is 1 or 2 characters enclose^ in single quotes “ Within a character con^^tan^ 
a single quote must be preceded by a back-slash Certain non-graphic characters, and “V’ itself 
may be escaped according to the following table: 


BS 

\b 

NL 

\n 

CR 

\r 

HI 

'.t 

ddd 

\ddd 

\ 

W 


The escape \ddcf consists of the backslash followed by 1, 2, or 3 octal digits which are taken to speci¬ 
fy the value of the desired character A special case of this construction is (not followed bv a ui- 
gil) which indicates a null character. 

Character constants behave exactly like integers (not, in particular, like objects of character type) In 
conformity v uh the addressing structure of the PDP-11, a character constant of length I has the cod- 
for the given character in the low-order byte and 0 in the high-order byte; a character constant of 
length 2 has the code for the first character in the low byte and that for the second character in the 
high-order byte. Character constants with more than one character are inherently machine-dependent 
and should be avoided. 


2.3.3 Floating constan‘.s 

A fioating constant consists of an integer part, a decimal point. i fraction part, an 8. and ar optional- 
ly signed integer exponent. The integer and fraction parts both consist of a sequence of digits. Either 
the integer pan or the fraction part (not both) may be missing; either the decimal point or l.he e and 
the exponent (not both) may be missing. Every fioating constant is taken to be double-precision 


2.4 Strings 

r a_sequence of characters surrou.nded by double quotes - A string has the tvpc arrav- 

Or-LharuLier:b :> 2 e below) and re;er$ to an area of storage initialized with the given characters The 

f nd^ ^ "I’ programs which scan tne string can 

cl K H character must be preceded by a “V'; in addition, the same escapes 

as described :or character constants may be used. ^ 
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3. Syntax notation 

In the syntax notation used in this manual, symac’ic categories are indicated by talic type, and 
liierai words and characters in gothic. Alternatives are listed on separate lines. An oydonaJ lerminal 
or non-terminal symbol is indicated by the subscript “opC’ so that 

{ expression^ } 

would indicate an optional expression in braces. 

4. What’S in a Name? 

C bases the interpretation of an identifier upon two attributes of the identiner: its sto^z^e class and its 
i\pe. The storage class determines the location and lifetime of the storage associated wim an identifier; 
:ne type determines the meaning of the values found in :he identifier’s storage. 

There are four declarable storage classes: automatic, r^iic, external, and register. Automatic vari- 
ibles are local to each invocation of a function, and are ciscarded on return; static variaries are local to 
-'unciion, but retain their values independently of invreasons cf the funciion; exte—il variables are 
■.“dependent of any function. Register variables are stored in the fast registe.“S of the michine, like au¬ 
tomatic variables they are local to each function and disappear on return. 

C supports four fundamental types of objects: characters, integers, single-, and cdubie-precision 
Routing-point numbers. 

Characters (declared, and hereinafter called, cha^; are chosen from the ascii -se:; they occupy 
the right-rrosi seven bits of an 8-bit byte. It is also possible tc inrarprei chars as signed: 2's - 
complement 8-bii number:. 

Integers (int) are represented in 16-bit 2’s complemenl notation. 

Singje precision floating point (float) quantities have magnitude in the range approximately 
or 0; their precision is 24 bits cr about seven decimal digits. 

Double-precision fioaling-p-oint (double) quantises have the same icnge as f:o£3 and a preci¬ 
sion of 56 bits or about 17 decimal digits. 

Besides the four fundamental types there is a conceprually infinite class of derived types constructed 
from the fundamental types in the following ways: 

arays of objects of most types; 

functions which return objects of a given type; 

. pointers to objects of a given type; 

5 /nvmroj coriaining objects of varioLL *ypes. 

In general these methods of constructing objects can be applied recursively. 

5- Objects and lvalues 

An object is a manipulaiable region of storage; an lvalue is an expression referring to an object. .An 
envious example of an lvalue expression is an identifier. There are operators which yield lvalues: for 
example, if E is an expression of pointer type, then *E is an lvalue expression referring :o the object to 
which E points. The name “lvalue" comes from the assignment expression **E1 = E2“ in which the 
left operar»d El must be an lvalue expression. The discussion of each operator below indicates wheth¬ 
er it expects lvalue operands and whether it yields an lvalue. 

6. Conversions 

.A number of operaurs may, depending on their operands, cause conve'sion of ire vaiue of an 
crerand from one type to another. This section explains the result to be expected from suen conver¬ 
sions. 
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6.1 Characters and integers 

A char coject may be used anywhere an int may be. In all casei the char is converted to an int by 
propagating its sign through the upper 8 bits of the resultant integer. This is consistent with the two's 
complement representation used for both cnaracters and Integers. (However, the sign^propacation 
feature disappears in other implementations.) 

6.2 Float and double 

All floating arithmetic in C is carried out in double-precision: whenever a float appears in an expres¬ 
sion it is lengthened to double by zero-padding its fraction. When a double must be convened to 
float, for example by an assignment, the double is rounded before truncation to float :ength. 

6.3 Float and double; integer and character 

All ints and chars may be converted without loss of significance to float or double. Conversion of 
boat or double to mt or char takes place with truncation towards 0. Erroneous results can be ex'^cted 
if the magnitude of the result exceeds 32,767 (for Int) or 127 (:':r char). 

6.4 Pointers and interers 

Integers and pointers may be acced and compared; in such a case the Int ;s conver.e: as specif.ed in 
the discussion of the addition operator. 

Two pointers to objects of the same type may be subtracted: in this case the resui: is convened to 
an integer as specified in the discussion of the subtraction operator. 


7. ".tNpression? 

1 he precedence of expression operators is the same as the order of the major subsections of this sec¬ 
tion (highest precedence first). Tnus the expressio.ns referred to as the operands of -h (§7.4) are those 
expressions defined in §§7.1-7.3.^ Within each subsection, the operators have the same precedence. 
Left- or right-associatryiiy is specified in each subsection for the operators discussed therein. The pre¬ 
cedence and associativity of ail the expression operators is summarized in an appendix. 

Otherwise the order of evaluation of expressions is undefined. In particular l^he cempiler considers 
Itself free to compute s^jexpressions in the order it believes miCsi efficient, even if ih* subexpressions 
involve side effects. 

7.1 Primary expressions 

Primary expressions involving ., ->, subscripting, and function calls group left to 

7.1.1 identifier 

An identifier is a primary expression, provided it has been suitably declared as disc:ssed below Its 
type IS specified by its declaration. However, if the type of the identifier is “array ct “ then the 
value of the identifier-expression is a pointer to the first object in the array, and the tvpe of the expres¬ 
sion IS “pointer to Moreover, an array identifier is not an (value expression. 

Likewise, an identifier which is declared “function returning ...*\ when used except “n the function- 
name position of a call, is converted to “pointer to function returning 

7.1.2 constant 

A decimal octal, character, or foaling constant Is a primary expression, its type is int in the first 
three cases, double in the last. 


7.1.3 strinj^ 

A smng is a primary expression. Its type is originally “array of char”; but followini the ^ame rule 
cLTaclenl, [hi'string '' ' " "" '» 
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7.1.4 ( expression ) 

A parenthesized expression is a primary ^•xpression whose type and value are identical to these of 
the unadorned expression. The presence cf parentheses does not affect whether the expression is an 
lvalue. 

7.1.5 primary-expression [ expression ] 

A primary expression followed by an expression in square brackets is a primary expression. The in¬ 
tuitive meaning is that of a subscript. Usually, the primary expression has type ‘‘pointer to the 
subscript expression is int, and the type of the result is The expression “EUEIl” is idenucaJ 

(by definition) to “• ( ( El) + (E2) ) ”. All the clues needed to understand this notation are contained 
in this section together with the discussions in §§ 7.1.1, 7.2.1, and 7.4.1 on identifiers, •, and + respec¬ 
tively; §14.3 below summarizes the implications. 

7.1.6 primary-expression { expression-list^^) 

A function call is a primary expression followed by parentheses containing a possibly er.pty, 
comma-separated list of expressions which constitute tPe actual arguments to the function. The rri- 
mary expression must be of type “function returning and the result of the funciior call is cf v-pe 
As indicated below, a hitherto unseen identifier followed immediately by a left parenthes s is 
contextually declared to represent a function returning an integer; thus in the most common case. 
Integer-valued functions need not be declared. 

Any actual arguments of type float are converted to double before the call; any of type ch£' are 
convened to int. 

In preparing for the call to a function, a copy is made of each actual parameter; thus, all argurr.ent- 
passing in C is strictly by value. A function may change the values of its formal parameters, but these 
changes cannot possibly affect the values of the actual parameters. On the other hand, it is pereciiy 
possible to pass a pointer on the understanding that the function may change the vMue of the ob eci to 
which the pointer points. 

Recursive calls to any function are permissible, 

7.1.7 primary-lvalue . member-of-structure 

An lvalue expression followed by a dot followed by the name of a member of a structure is a pri¬ 
mary expression. The object referred to by the lvalue is assumed to have the same form as the struc¬ 
ture containing the structure member. The result of the expression is an lvalue appropriately effset 
from the origin of the given lvalue whose type is that of the named structure member. The given 
lvalue is not required to have any particular type. 

Structures are discussed in §8.5. 

7.1.8 primary-expression — > member-of-structure 

The primary-expression is a.ssumed to be a pointer which points to an object of the same form as the 
structure of v -'ch the member-of-siructure is a part. The result is an lvalue appropriately offset from 
the origin of the pointed-lo structure whose type is that of the named structure member. The iyj>e of 
the primary-expression need not in fact be pointer; it is sufficient that it be a pointer, character, or in¬ 
teger. 

Except for the relaxation of the requirement that El be of pointer type, the expression 
“El —>MOS” is exactly equivalent to “(•£!).MOS'''. 

7.2 Unary operators 

Expressions with unary operators group right-to-Iefl. 

7.2.1 • expression 

The unary • operator means indirection: the expression must be a pointer, and the result is an iv-iue 
referring to the object to which the expression points. If the type of the expression is '‘pointer to 
the type of the result is 
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7.2.2 & Ivalue-expression 

The result of the unary & operator is a poir^ter to the object referred to by the Ivalue-exuression If 
the type of the lvalue-expression is \ the type of the result is “pointer to 

7.2.3 — expression 

The result is the negative of the expression, and has the same type. The type of the -xn.ession 
must be char, mt, float, or double. ^ expiession 

7.2.4 ! expression 

,h J"!® operator ! is 1 if the value of the expression is 0 0 T the value of 

.hy xpression „ „or-ip,o. Thp .,p. of ,he resul, is ini This operhio? is appfabiP onlrto llSs t 

7.2.5 ^expression 

0, cl;h,:a°^S'e"=sS?!s'!’ni°"''' “f The type of ,he expression mus. be int 

7.2.6 -h-h lvalue-expression 

The object referred to by the lvalue expression is incremented. The value is the new value of the 

menteH '^e expression VU or c^h is ^re- 

ented bv 1, if it is a pointer to an object, it is incremented by the length of the object + 4 - k annii 
caole only to these types. (Not, for example, to float or double.) ^ ^ppli- 

7.2.7 — lvalue-expression - ^ 

The object referred to by the lvalue expression is decremented analogously to the ++ operator. 

7.2.8 halue-cxpression ++ 

7.2.9 halue-expressicr: — 

7.2.10 sizeof expression 

.ho^Lfn" mb^'f^bVSt IS: ?,'y Vho'y U to,™Tnld » 

.ho exprossio,,. This Expression is 

Siam ,s required, l.s mqlcr use is in oommunioa.ion wi.h routines like lio^e .lloca.ors’eJd'l/O s™.' 

7.3 Multiplicative operators 

The multiplicative operators •. /, and % group lefi-to-right. 

7.3.1 expression • expression 

onlls' ,nfo?=;p7."^''-rntro, •h'"' 

If bo,h are lloal o, double, ihe resell is double. No other c"r^J“,Ls .rfa.to.e^^^ 
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7.3.2 expression / expression 

The binan- / operator iiroicate, diviaio,,. The aame type cohsideratiora as for multiplication apply. 

7.3.3 expression \ expression 

ihe binary % operator yields the remainder from the division nr a 
B oth operands must be int or char, and the result is int In the expression by the second, 

has the same sign as the dividend. " implementation, the remainder 

7,4 Additive operators 

The additive operators -f and — group lef*i-io-righl. 

7.4.1 expression 4 expression 

converted to double and the result is double If an int or double, the fenmer is 

verted by multipiying it by the lengtf ? the ^fe^T 

pointer of the same tvpe as the origirai oointer f If P ^ result is a 

-P+r- is a pointer to another objS cf tSe sw^^pVas 

storage. immediately following it in 

No Other type combinations are allowed. 

7 4.2 expression — expression 

converted m tht same way as explained under + above ^ Pomter, the former is 

z 'r" “ 

This conversion wiil in general give unexpected results unle« separating the pointed-to objects, 
array, since pointers, even to objects onhe samTtv^e So^^^^^^^ 

object-length. °° necessarily differ by a multiple of the 

7.5 Shift operators 

The shift operatois < < and >> group left-to-righL 

7.5.1 expression < < expression 

7.5.2 expression > > expression 

Both operands must be int or char and the rp^Mir ic ir>* tu 
negative. The value of “E1<<E2’' is El (internretpH ac w* operand should be non¬ 
bits; vacated bits are 0-filled. The value of ^‘E1>>E2” is^ Fl 16 b,is long) left-shifted E2 

I6.bii quantity) arithmetically ri-ht-shifted E2 bit ^ complement, 

sign bit of El. [Note: the use of aritEic ratherThan ^ ‘he 

between machines.) iogical shift does not survive transponaiion 

7.6 Relational oper.aiors 

‘'f'-'-"*'--- '”'' "■« f«, is .„y „„rui: ■•c<b<c-<,o=c uo. cc,„ 

7.6.1 expression < expression 

7.6.2 expression > expression 
^-6.3 expression < = expression 
/.6.4 expression > = expression 

ccS^oT.ifySo%'!^fh;^" dL'S " >- -h- o, 

the same as for the - operate except^ S S)in ers of anJ'■’ conversion is exactly 

case depends on the relative locaS In i'craTofThfno^^eHT T'' 'h® ^«ult in this 

meaningful to compare pointers with integer other than 0. ° 
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7.7 Equality operators 

7.7.1 expression — == expression 

7.7.2 expression ! * expression 

The =*=* (equal to) and the !=* (not equal to) operators are exactly analogous to the relational opera¬ 
tors except for their lower precedence. (Thus “a<b —=c<d” is I whenever a<b and c<d have the 
same truth-value). 

7.8 expression & expression 

The & operator groups !efi-to-right. Both operands must be int or char; the result is an int which is 
the bit-wis^ logical and function of the operands. 

7.9 expression * expression 

The " operator groups left-to-right. The operands must be int or char; the result is an int w'hich is 
the bit-wise exclusive or function of its operands. 

7.10 expression i expression 

The I operator groups len-to-right. The operands must be int or char; the result is an int which is 
the bit-wise inclusive or of i'.s operands. 

7.11 expression && expression 

The && operator returns 1 if both its operands are non-zero, 0 otherwise. Unlike i, && guarantees 
iefi-io-right evaluation; moreover the second operand is not evaluated if the first operand is 0. 

The operands need not have the same type, but each must have one of the fundamental types or be . 
a pointer. — 

7.12 expression II expression 

The II operator returns 1 if either of its operands is non-zero, and 0 otherwise. Unlike I, II guaran¬ 
ty :s lefi-to-right evaluation; moreover, the second operand is not evaluated if the value of the first 
operand is non-zero. 

The operands need not have the same type, but eacn must have one of the fundamental types or be 
a pointer. 

7.13 expression ? expression : expression 

Conditional expressions group left-to-righi. The first expression is evaluated and if it is non-zero, 
the result is the value of the second expression, otherwise that of third expression. If the types of the 
second and third operand are the same, the result has their common type; otherwise the same conver¬ 
sion rules as for -r apply. Only one of the second and third expressions is evaluated. 

7.14 Assignment operators 

There are a number of assignment operators, all of which group right-to-left. All require an lvalue as 
their left operand, and the type of an assignment expression is that of its left operand. The value is 
the value siored in the left operand after the assignment has taken place. 

7.14.1 lvalue = expression 

The value of the expression replaces that of the object referred to by the lvalue. The operands need 
not have the same type, but both must be*int, char, float double, or pointer. If neither operand is a 
pomier, the assignment takes place as expected, possibly preceded by conversion of the expression on 
the right. 

When both operands are int or pointers of any kind, no conversion ever takes place; the value of the 
expression is simply stored into the object referred to by the lvalue. Thus it is possible to generate 
pointers which will cause addressing exceptions when used. 
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8.2 Type specifiers 
The type-specifiers are 

type-specifier: 

Int 

char 

float 

double 

struct { [ype-decNist ) 
struct identifier { type-decNisr } 
struct identifier 

The struct specifier is discussed in §8,5. If the type-specifier is missing from a declaration, it is gen¬ 
erally taken to be int. 

8.3 Declarators 

The declarator-list appearing ii; a declaration is a comma-separated sequence of declarators. 

. declarator-list: 

declarator 

declarator , declarator-list 

The specifiers tn the declaration indicate the type and storage class of the objects to which the declara- 
lors refer. Declarators have the syntax: 

declarator: 

identifier 
• declarator 
declarator () 

declarator [ constant-expression^ ] 

( declarator ) 

The grouping in this definition is the same as in expressions. 

8.4 Meaning of declarators 

Each declarator is taken to be an assertion that when a construction el the same form as the declara¬ 
tor appears in an expression, it yields an object of the indicated type and storage class. Each declarator 
contains exactly one identifier, it is this identifier that is declared. 

If an unadorned identifier appears as a declarator, then it has the type indicated by the specifier 
heading the declaration. 

If a declarator has thd form 

• D 

for D a declarator, then the conuined identifier has the type “pointer to ...*\ where is the type 
which the identifier would have had if the declarator had been simply D. 

If a declarator has the form 
DO 

then the contained identifier has the type “function reluming where is the type which the 
identifier would have had if the declarator had been simply D. 

declarator may have the form 

Dfconstant-expression) 

or 

D[1 

In the first case the constant expression is an expression whose value is determinable at compile lime. 
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and wlidse type is int. in the second the constant 1 is used: (Constant expressions r:t ceined precise¬ 
ly in §15.) Such a declarator makes the contained identifier have type “array.” If the unadorned de¬ 
clarator D would specify, a non-array of type tnen the declarator yields a 1-dimensicnal 

array with rank i of objects, of type If the unadorned declarator D wouL specify an 

/ 7 -dimensional array with rank ^'.x/^x ...xthen the declaictor yields an 

(/j-hi) -dimensional array with rank x x ... x x 

An array may be constructed from one of the basic types, from a pointer, from a structure, or from 
another array (to generate a muki-dimensional array). 

Finally, parentheses in declarators do not alter the type of the contained identifier except insofar 2 S 
they alter the binding of the components of the declarator. 

Not all the possibilities allowed by the syntax above are actually permitted. The restrictions are’s 
follows: functions may not return arrays, structures or functions, although they r:*ay return pointers :o 
such things; there are no arrays of functions, although there may be arrays of pointers to functichs. 
Likewise a structure may not contain a function, but it may contain a pointer to a funetioo. 

As an example, the declaration 
int i. *ip, H ), *iip( )t {*Pn)( ): 

declares an integer /, a pointer ip to an integer, a function /returning an integer, a functicr: fip relum¬ 
ing a pointer to an integer, and a pointer pfi to a function which returns an integer. Also 

float fa[17]. *afpl17]: 

declares an array of float nurtibers and an.array of pointers to float numbers. Finally, 
static int x3di3l[5l[7l: 

declares a static three-dimensional array of integers, with rank 3x5x7. In compete detail, x3d \z an ar¬ 
ray of three items: each item is an array of five arrays; each of the letter arrays is an array of seven in¬ 
tegers. Any of the expressions “x3d'\ “xSdlil”, “x3d[i](jl”. “jJdlilljlfkr may reasonably ap:»ear 
in en expression. The first three have type “array”, the last has type int 

8.5 Structure declarations 

Recall that one of the forms for a structure specifier is 
struct { type-decNist ) 

The type-decNist is a sequence of type declarations for the members of the structure: 

type-decl’list: 

type-declaration 
type-declaraiion type-decNist 

A type declaration is just a declaration which does not mention a storage class (the r^rage class 
“member of structure” here being understood by context). 

(ype-declara tion: 

type-specifier declcratcr-list ,; 

Within the structure, the objects declared have addresses which inciease as their declaraticns are re^i 
left-to-right. Each component of a structure begins on an addressing boundary appropriate :o \is tyre. 
On the PDP-11 the only requirement is that non-character^ begin on a word boundary; therefore, there 
may be 1-byte, unnamed holes in a structure, and all structures have an even length in bytes. 

Another form of structure specifier is 

struct identifier { type-decNist ] 

This form is the same as the one just discussed, except that the ideniiner is remembered is the str^- 
ture tag of the structure specified by the list. A subsequent declaration may then be given using the 
structure lag but without the list, as in the third form of structure specifier: 
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Stnjct identifier 

Structure tags allow definition of self-referential structures; they also permit the long part of the de¬ 
claration to be given once and used several times. It is however absurd to declare a structure which 
contains an instance of itself, as distinct from a pointer to an instance of itself 

A simple example of a structure declaration, taken from §16.2 where its use is illustrated more fully, 
is 

Struct tnode { 

char tword[?Ol; 
int count; 
struct tnode *left; 
struct tnode *right; 

1 1 

which contains an array of 20 characters, an integer, and two pointers to similar structures. Once this 
declaration has been given, the following declaration makes sense: 

struct tnode s. ^sp; 

which dechres s to be a structure of the given sort and sp to be a pointer to a structure of the given 
sort. 

The name: of structure members and structure tags may be the same as ordinary variables, since a 
distinction can be made by context. However, names of tags and members must be distinct. The 
same member name can appear in different structures only if the two members are of the same type 
and if their origin with respect to their structure is the same; thus separate structures can share a com¬ 
mon initial segment 

9. Statements 

Except as indicated, statements are executed in sequence. 

9.1 Expression statement 

Most statements are expression statements, which have the form 
expression ; 

Usually expression statements are assignments or function calls. 

9 2 Compound statement 

So that several statements can be used where one is expected, the compound statement is provided: 

compound-statement: 

{ statement-list } 

statement-list: 

statement 

statement statement-list 

9.3 Conditional statement 
The two forms of the conditional statement are 

if ( exp.'^essic ,’:) statement 

if ( expression ) statement else statement 

In bom cases the expression is evaluated and if it is non-zero, the first substatement is executed. In 
the second case the second substatement is executed if the expression is 0. As usual the “else” ambi¬ 
guity is resolved by connecting an else with the last encountered elseless if. 
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9.4 While statement 

The while statement has the form 

while ( expression ) statement 

The substatement is executed repeatedly so long as the value of the expression remains non-zero. The 
test takes place before each execution of the statement. 

9.5 Do statement 

The do statement has the form 

do statement while ( expression ); 

The substatement is executed repeatedly until the value of the expression becomes zero. The test 
takes place after each execution of the statement. 

9.6 For statement 

The for statement has the form 

for ( expression -; expression-2^^ ; expression-3 ^,) statement 

This statement is equivalent to 

expression-1; 
while {expression-2) { 
statement 
expression-3\ 

Thus the first expression specifies initialization for the loop; the second specifies a test, made before 
each iteration, such that the loop is exited when the expression becomes 0; the third expression typi¬ 
cally specifies an incrementation which is performed after each iteration. 

Any or all of tiie expressions may be dropped, A missing expression-2 makes the implied while 
clause equivalent to “while( 1)”; other missing expressions are simply dropped from the expansion 
above. 

9.7 Switch statement 

The switch statement causes control to be transferred to one of several statements depending on the 
value of an expression. It has the form 

switch ( expression ) statement 

The expression must be int or char. The statement is typically compound. Each statement within the 
statement may be labelled with case prefixes as follows: 

case constant-expression : 

where the constant expression must be int or char. No two of the case constants in a switch may have 
the same value. Constant expressions are precisely defined in §15. 

There may also be at most one statement prefix of the form 
default: 

When the switch statement is executed, its expression is evaluated and compared with each case con¬ 
stant in an undefined order. If one of the case constants is equal to the value of the expression, con¬ 
trol is passed to the siaement following the matched case prefix. If no case constant matches the ex¬ 
pression, and if there is a default prefix, control p^'sses to the prefixed statement. In the absence of a 
deiault prefix none of the statements in the switch is executed. 

Case or default prefixes in themselves do not alter the flow of control. 
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9.8 Break statement 
The statement 

break ; 

causes ternination of the smallest enclosing while, do, for, or switch statement; control passes to the 
statement following the terminated statement 

9.9 Continue statement 
The statement 

ccrrtinue ; 

causes control to pass to the loop-continuation portion of the smallest enclosing while, do, or for state¬ 
ment; that Ls to the end of the loop. More precisely, in each of the statements 

while (...){ do { for (...) { 

cortin:; contin:; contin:; 

} /while (...): } 

a continue Ls equivalent to “goto contin“. 

_9.I0 Return itatemeni 

A function returns to its caller by means of the return statement, which has one of the forms 
return ; 

return ( expression ); 

In the first r j r.^ value is returned. In the second case, the value of the expression is returned to the 
caller of the unction. If required, the expression is converted, as if by assignment, to the type of the 
function in which it appears. Flowing off the end of a function is equivalent to a return with no re¬ 
turned value- 

9.11 Goto s*.atemen: 

Control may be transferred unconditionally by means of the statement 
goto expression ; 

The expression should be a label (§§9.12, 14.4) or an expression of type “pointer to Int” which evalu¬ 
ates to a label. It is illegal to transftr to a label not located in the current function unless some extra¬ 
language provision has been made to adjust thi^ stack correctly. 

9.12 Labelled statement 

Any statement may be preceded by label prefixes of the form 
identifer : 


which serve :o declare the identifier as a label. More details on the semantics of labels are given in 
§14.4 below. 

9.13 Null sutement 
The null suiemenl has the form 


A null statement is useful to carT>' a label just before the of a compound statement or to supply a 
null body to a looping itaiement such as while. 
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10. External definitions 

A C program consists of a sequence of external definitions. External definitions may be given for 
functions, for simple variables, and for arrays. They are used both to declare and to reserve storage for 
objects. An external definition declares an identifier to have storage class extern and a specified type. 
The type-specifier (§3.2) may be empty, in which case the type is taken to be int 

10.1 External function definitions 
Function definitions have the form 

function-definition: 

type-specifier^ function-declarator function-body 

A function declarator/s similar to a declarator for a “function returning except that it lists the for¬ 
mal parameters of the function being defined. 

function-declarator: 

declarator ( parameter-list ^) 

parameter-list: 

identifier 

identifier , parameter-list 
The function-body has the form 
function-body: 

type-decl-list function-statement 

The purpose of the type-decl-iisi is to give the types of the formal parameters. No other identifiers 
should be declared in this list, and formal parameters should be declared only here. 

The function-statement is just a compound statement which may have declarations at the start 

function-statement: 

{ declaration-list^ statement-list } 

A simple example of a complete function definition is 

Int max (a. b, c) 
int a, b, c; 

{ 

int m; 

m =* {a> b)? a:b; 
return (m > c? m: c); 

1 


Here “int“ is the type-specifier; “max(a, b, c)“ is the function-declarator; “int a, b, c;“ is the type- 
decl-list for the formal parameters; “{ ... 1“ is the function-statement 

C converts all float actual parameters to double, so formal parameters declared float have their de¬ 
claration adjusted to read doubla Also, since a reference to an array in any context (in particular as an 
actual parameter) is taken to mean a pointer to the first element of the array, declarations of formal 
parameters declared “array of..." are adjusted to read “pointer to ...“. Finally, because neither struc¬ 
tures nor functions can be passed to a function, it is useless to declare a forma! parameter to be a 
structi-re or function (pointers to structures or functions are of course permired). 

A free return statement is supplied at the end of each function definif on, sc running off the end 
causes control, but no value, to be returned to the caller. 

10.2 External data denniiions 

An external data dennition has the form 
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data-definition: 

extern^ type-specifier^^ inihdeciaratordist ^^; 

The optional extern specifier is discussed in § 11.2. If given, the init-declarator-list is a comma- 
separated list of declarators each of which may be followed by an iniualizer for the declarator. 

init-declarator-Ust: 

init-declarator 

init-declarator , init-declaratordist 

inihdeclarator: 

declarator initializer^ 

Each initializer represents the initial value for the corresponding object being defined (and decla,*ed). 

initializer: 

constant 

1 constant-expression-lis :) 


constant-expression-list: 

consiant-expression 

constant-expression , constant-expression-list 

Thus an initializer consists of a constant-valued expression, or comma-separated list of expressions, in¬ 
side braces. The braces may be dropped vvhen the expression is just a plain constarit. The exact 
meaning of a constant expression is discussed in §15. The expression Ust is used to initialize arrays, 
see below. 

The type of the identifier being defined should be compatible with the type of the initializer: a dou¬ 
ble constant may initialize a float or double identifier; a non-floating-poini expression may initialize an 
int, char, or pointer. 

An initializer for an array may contain a comma-separated list of compile-time expressions. The 
length of the array is taken to be the maximum of the number of expressions in the list and the 
square-bracketed constant in the array’s declarator. This constant may be missing, in which case 1 is 
used. The expressions initialize successive members of the array stalling at the origin (subscript 0) of 
the array. The acceptable expressions for an array of type “array of are the same as those for type 
As a special case, a single string may be ^ven as the initializer for an array of chars; in this case, 
the characters in the string are taken as the initializing values. 

Structure^ can be initialized, but this operation is incompletely implemented and machine- 
dependent. Basically the structure is regarded as a sequence of words and the initializers are placed 
into those words. Structure initialization, using a comma-separated list in braces, is safe if all the 
members of the structure are integers or pointers but is otherwise ill-advised. 

The initial value of any externally-defined object not explicitly initialized is guaranteed to be 0. 

11. Scope rules 

A complete C program need not all be compiled at the same time: the source text of the program 
may be kept in several files, and precompiled routines may be loaded from libraries. Communication 
among the functions of a program may be carried out both through explicit calls and through manipu¬ 
lation of external data. 

Therefore, there are two kinds of scope to consider: first, what be called the lexical scope of an 
identifier, which is essentially the region of a program during which it may be used wit ut drawing 
“undefined identifier” diagnostics; and second, the scope associated wth external identifiers, which is 
characterized by the rule that references to the same external identifier are references to the same ob¬ 
ject. 


k 
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11.1 Lexical scope 

C is not a biock-structurer< language: this may fairly be ccnsidered a defect The lexical scope of 
names declared in exiernal definitions extends from Iheir definition tL-Dugh the end of the file in 
which they appear. The lexical scope of namei. declared at the head of functions (either as formal 
parameters or in the declarations heading the statements consiiiuling the function itselO is the body of 
the function. 

It is an error to redeclare identifiers already declared in the current context, unless the new declara¬ 
tion specifies the same type and storage class as already possessed by the identifiers. 

11.2 Scope of externals 

If a function declares an identifier lO be extern, then somewhere among the files or libraries consti¬ 
tuting the complete program there must be an external definiiion for ‘he identifier. All functions in a 
given program which refer to the same external identifier refer :o the sa^rie object, so care must be tak¬ 
en that the type and extent specified in the definition are compatible with these specified by each 
function which references the data. 

In PDP-11 C, it is explicitly permitted for (compatible) exienal definitions of the same identifier to 
be present in several of the separately-compiled pieces of a cenpiete program, or even twice within the 
same program file, whh the important limitation that the ider/ifier may be initialized in at most one of 
the definitions. In other operating syr.ems, however, the co—piler mur. know in just which file the 
storage for the identifier is allocated, and i \ which file the identifier is merely being referred to. In the 
implementations of C for such systems, the appearance of ‘Jie extern keyword before an external 
definition indicates that storage for the identifiers being declared will be -liocated in another file. Thus 
in a multi-file program, an external data definition without the extern specifier mest app^ in exactly 
one of the files. Any other files which wish to give an extv nal definitSon for the identifier must in¬ 
clude the extern in the definition. The identifier can be initiidzed only :n the file where storage is al¬ 
located. 

In PDP-11 C none of this nonsense is necessary and the extern specifier is ignored in external 
definitions. 

12. Compiler control lines 

When a line of a C program begins with the character #, it s interpreted not by the compiler itself, 
but by a preprocessor which is capable of replacing instances cf given identifiers with arbitrary token- 
string.s and of inserting named files into the source program. Ln order to cause ihs preprocessor to be 
invoked, it is necessary that the very first line of the program begin with #. Since null lines are ig¬ 
nored by the preprocessor, this line need contain no other information. 

12.1 Token replacement 

A compiler-comrol line of the form 

# define identifier icken-string 

(nrte: no trailing semicolon) causes the preprocessor to replace subsequent instances of the identifier 
with the given string of tokens (except within compiler coniral lines). The replacement token-string 
has comments removed from it, and it is surrounded with bisnks. No rescanning of the replacement 
string is attempted. This facility is most valuable for definition of “manifest constants”, as in 

# define tabsize 1CXD 

Int tabieltabsize]; 


12.2 File inclusion 

Large C programs often contain many external data definibens. Since :he lexica] scope of external 
definitions extends to the e^nd of the program file. It is good practice to pot all the external definitions 
for data at the start of tl- e program file, so that the functions defined wrJiin the file need not repeat 
tedious and error-prone declarations fer each external ideniiber they use. It is also useful to put a 
heavily used structure definition at the smrt and use its structure tag to declare the auto pointers to the 
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structure used within functions. To further exploit this technique when a large C program consists of 
several files, a compiler control line of the form 

# include "^filename"* 

results in the replacement of that line by the entire contents of the file filename. 


13. Implicit declarations . 

It is not always necessary to specify both the storage class and the type of identifiers in a declaration. 
Somiiimes the storaS c^^^^ by the context: in external definitions and in declarations of 

formal parameters and structure members. In a declaration inside a function. J} 'Si” the 

type i= given, the identifier is assumed to be int; if a type but no storage class is •ndicat^. Je 
id^tifier^is assumed to be auto. An exception to the latter rule is made for functions, since auto func¬ 
tions are meaningless (C being incapable of compiling code into the stack). If the type of an identifie 
is “function returning it is implicitly declared to be extern. 

In an expression, an identifier followed by ( and not currently declared is contextually declared to be 
“function returning int'\ 

Undefined identifiers not followed by ( are assumed to be la«is which will be ‘*1® 

function. (Since a label is not an lvalue, this accounts for tne Lvalue 

times noticed when an undeclared identifier is used.) Naturally, appearance of an idei.an^r a., a labe 
declares it as such. 

For some purposes it is best to consider formal parameters as belonging to their own storage class 
In practice, C treats parameters as if they were automatic (except that, as mentioned above, formal 
parameter arrays and floats are treated specially). 


14. Types revisited . r - 

This section summarizes the operations which can be performed on objects of certain types. 


14.1 Structures ^ ti. 

There are only two things that can be done with a structure; pick out one of its members (by means 
of the . or — > operators); or take its address (by unary &). Chher operations, such as as^ginng 
or to it or passing it as a parameter, draw an error message. In the future, tt is expected that these 
operations, but not necessarily others, will be allowed. 


14.2 Functions 

There are only two things that can be done with a function: call it, or take its address. If the naine 
of a function appears in an expression not in the function-name position of a .call, a pointer to the 
function is generated. Thus, to pass one function to another, one might say 

int f(); 


g(f): 

Then the definition of g might read 

g(funcp) 
int (•funcp) (): 

{ 

(•funcp) (): 


Notice that /was declared explicitly in the calling routine since its first appearance was not followed by 
(. 
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14.3 Arrays, pointers, and subscripting 

Every lime an identifier of array type appears in an expression, it is converted into a pointer to the 
first member of the array. Because of this conversion, arrays are not lvalues. By definition, the sub¬ 
script operator I] is interpreted in such a way that “E1[E2]” is identical to “♦((E1) + (E2))”. Because 
of the conversion rules which apply to -f, if El is an array and E2 an integer, then El[£2l refers to 
the E2-th member of El. Therefore, despite its asymmetric appearance, subscripting is a commutative 
operation. 

A consistent rule is followed in the case of multi-dimensicnal arrays. If E is an n-dimensional array 
of rank then E appearing in an expression is converted to a pointer to an 

(rt—l)-dimensional array with rankyx ...x/c. If the ♦ operator, either explicitly or implicitly as a result 
of subscripting, is applied to this pointer, the result is the pointed-to (/j— l)-dimensional array, which it¬ 
self is immediately converted into a pointer. 

For example, consider 
Int xi3lt5]; 

Here .x is a 3x5 array of integers. When x appears in an expression, it is converted to a pointer to (the 
first of three) 5-membered arrays of integers. In the expression '"x[i)**, which is equivalent to 
“•(x+i)”, X is first converted to a pointer as described; then / is convened to the type of x, which in¬ 
volves multiplying / by the length the object to which the po'-^er points, narnely 5 integer objects. 
The results are added and indirection applied to yield an array (of i mie^rs) which in turn is convert¬ 
ed to a pointe- to the first of the integers. If there is another subscript the same argument applies 
again; this time the result is an integer. ~ 

It follows from all this that arrays in C are itcred row-wise (last subscript varies fastest) and that the 
first subscript in the declaration helps determine the amount of storage consumed by an array but 
plays no other part in subscript calculations. 

14.4 Labels 

Labels do not have a type of their own; they are treated as having type “array of int”. Label vari¬ 
ables should be declared “pointer to int”; before execution of a goto referring to the variable, a label 
(or an expression deriving from a label) should be assigned to the variable. 

Label variables are a bad idea in general; the switch statement makes them almost always unneces¬ 
sary. 

15. Constant expressions 

In several places C requires expressions which evaluate to a constant: after case, as array bounds, 
and in initializers. In the first two cases, the expression can involve only integer constants, character 
constants, and sizeof expressions, possibly connected by the binary operators 

+ - * / % d I * << >> 

or by the unary operators 


Parentheses can be used for grouping, but not for func.ion calls. 

A bit more latitude is permitted for initializers; besides constant expressions as discussed above, one 
can also apply the unary & operator to external scalars, and to external arrays subscripted with a con¬ 
stant expression. The unary & can also be applied implicitly by appearance of unsubscripied external 
arrays. The rule here is that initializers must evaluate either to a consent or to the address of an 
external identifier plus or minus a constant. 
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16. Examples. „ • ui . i- 

These examples are intended to illustrate some typical C constructions as well as a serviceable style 

of writing C programs. 

16.1 Inner product 

This function returns the inner product of its array arguments. 

double inner (v1, v2, n) 
double v1( 1, v2[}; 

{ 

double sum; 
int i; 

sum — 0.0; 

for (i•=0; i<n; i++) 

sum =+ v1(i] • v2[i!; 
return (sum): 

1 

The following version is somewhat more efficient, but perhaps a little less clear. It uses the facts that 
parameter arrays are really pointers, and that all parameters are passed by value. 

double inner (v1, v2, n) 
double ‘Vi, *72; 

( 

double sum; 
sum — 0.0; 
while (n—) 

sum =+•v1++ • •v2++; 
return (sum); 

} 

The declarations for the parameters are really exactly the same as in the last example. In the first case 
array declarations “ [ 1” were given to emphasize that the parameters would be referred to as arrays; in 
the second, pointer declarations were given because the indirection operator and ++ were used. 


16.2 Tree and character processing 

Here is a complete C program (courtesy of R. Haight) which reads a document and produces an 
phabetized list of words found therein ‘ogether with the number of occurrences of each word. The 
method keeps a binary tree of words such that the left descendant tree for each word has all the words 
lexicographically smaller than the given word, and the right descendant has all the larger words. Both 
the insertion and the printing routine are recursive. 

The program calls the library routines getchar to pick up characters and ex/r to terminate execution. 
Print/ is called to print the results according to a format string. A version of print/ is given below 
(§16.3). 

Because all the external definitions for data are given at the top, no extern declarations are necessary 

within the functions. To stay within the rules, a type declaration is given for each non-integer func¬ 

tion when the function is used before it is defined. However, since all such functions return pointers 
which are simply assigned to other pointers, no actual harm would result from leaving out the declara¬ 
tions; the supposedly int function values would be assigned without error or complaint. 

* define nwords 100 /* number of different words •/ 

* define wsize 20 /* rnax chars per word •/ 

struct tncde { /* the basic structure •/ 

char tword[ wsize 1: 
int count: 
struct tnode ‘left; 
struct tnode ’right; 

I 
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Struct tnode space[nwordsI: 
int modes nwords: 
struct tnode *spacep space; 
struct tnode *freep; 


/• the words themselves •/ 

/• number of remaining slots •/ 
/• next available slot •/ 

/• free list •/ 


* The main routine reads words until end-^f-file (ACT returned from ’getchar**) 

* "tree* is called to sort each word into the tree. 


•/ 

main{) 

( 

struct tnode ’top, *tree(); 
char c. wordlwsize]: 

'nt i: 


i = top = 0; 
vhile (c=getchar()) 

if (■a'<='C c<='z :i 'A'< = : && c <='Z') { 

if (i<wsize—1) 

word[i+-r] = c: 

) else 

if (!) ( 

word[i-r+l = '\0'; 

_ top “ tree (top, word); 
i = 0: 

1 

(print (top): 

) 

A 

* The central routine. If the subtree pointer is null, allocate a new node for it. 

* If the new word and the node's word are the same, increase the node's count. 

* Otherwise, recursively sort the word into the left or right subtree according 

* as the argument word is less or greater than the node's word. 

•/ 

struct tnode ‘treefp, word) 
struct tnode *p; 
char word (]; 

( 

struct tnode •alloc (): 
int cond; 

/• Is pointer null? */ 
if (p==0) 1 

p = alloc (); 
copy (word, p->tword); 
p—>count = 1; 
p—>right « p—>left = 0; 
return (p); 

) 

/* Is word repeated? •/ 

if ( (cond=*compar(p~>tword. word)) “ 0) { 

p—>00001++: 

return (p); 

i 

/• Sort into left or right •/ 
if (cond<C) 

p->left = tree(p —>left, word); 

else 

p —>right = tree(p — >right, wore); 

■ 4 ’ 
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re.jm(p); 

) 

/• 

* Print the tree by printing the left subtree, the given node, and the right subtree 

•/ 

tprlnt(p) 
struct tnode *p: 

I 

while (p) I 

tprint(p->left); 

print!r%d: %s\n". p->count. p->tword); 
p * p->right; 


- String ccmpariP'^n: return number (>,=,<) 0 
‘ ucccrding ass. (>.*=. <^ 

•/ 

compar(s1. s2) 
char *s1. *82: 

i 

\ 

int c1. c2: 

whi!e({c1 ** •s1++) == (c2 =* *82++)) 
if (c1=-^0') 

return (0); 
retum(c2—cl); 

} 

/• 

• String copy: copy s1 into s2 until the null 

• character appears. 

copy{s1. s2) 
char *81. *82; 

( 

while (*82++ = ’sl+H"); 

I 

/• 

• Node allocation: return pointer to a free node. 

• Bomb out when all are gone. Just for fun. there 

• is a mechanism for using nodes that have been 

• freed, even though no one here calls "free.* 

•/ 

struct tnode ‘alloc () 

I 

struct tnode ‘t; 

if (freep) { 

t » freep; 

freep = freep—> left; 
return (t); 

if (—modes < 0) I 

printf ("Out of space\n"); 
exit (): 

} 

return (spacep++); 

} 

/• 
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• Th 3 uncalled routine which puts a node on the free list. 


free (p) 

struct tnode ‘p; 

{ 

p—>left “ freep: 
freep - p: 


To illustrate a slightly different technique of handling the same problem, w will repeat fraginents of 
this example with the tree nodes treated explicitly as members of m array. The fundamentalchan^ is 
to deal with the subscript of the array member under discussion, instead of a pointer to sL The struct 
declaration becomes 


struct tnode { 

char tword [ wsize 1; 
int count: 
int left; 
int right; 

): 


and alloc becomes 

aiiocO 
( • - 

int t; 

t = —nnodes; 
if(t<-0)( 

printf ("Out of space\n’); 
exit(); 

1 

return(t); 

) 

The free stuff has disappeared because if we deal with exclusively with subscripts some sort of map has 
to be kept, which is too much trouble. 

Now the tree routine returns a subscript also, and it becomes: 

tree (p, word) 
char word (1: 

{ 

int cond; 

if (p==0) { 

p = alloc () ; 

copy (word, space I pi.tword): 
space I pi.count =■ 1; 
space I pi.right = space [pi.left = 0: 
return (p): 

if ((cond=compar(space[pl.tword, word)) ==0) I 
space [ p 1 .count++; 
return (p); 

} 

if (cond<0) 

soacelpl.left = tree (space [pi.left, word); 

else , , 

space [pi.right = tree (space [pi.right, word): 

return{p); 


1 
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The other routines are changed similarly. It must be pointed out that this version is noticeably le^ 
efficient than the first because of the multiplications which must be done to compute an offset in space 

corresponding to the subscripts. . . • /i i 

The ohiervation that subscripts (like “atil”) are less efficient than pointer indhection (like 
“•ap”) holds true independently of whether or not structures are involved. There are of course many 
situations where subacripts are indispensable, and others where the loss in e iciency is wor . a g 
clarity. 


16.3 Formatted output . - .-v i. , 

Here is a simplified version of the />rmr/routine, which is available in the ^“cep« a 
string (character array) as first argument, and prints subsequent arguments 
contained in this format string. Most characters in the string are S'^Ply ^ 

character sequences beginning with “%” specify that the next argument should be pnnUd in a style as 

follows: 

%d decimal number 

%o c-ctal number , - n 

%c ASCII character, or 2 characters if upper character is not null 
%s string (null*terminated array of characters) 

%f floating-point number 

The actual parameters for each function call are laid out contiguously in increasing storage j^^tions; 
therefore, a function with a variable number of arguments may take the address of (say) its first argu¬ 
ment, and access the remaining arguments by use of sub^ripting (regarding the arguments as an ar¬ 
ray ) or by indirection combined with pointer incrementation. 

If in such a situation the arguments have mixed types, or if in general one wuhes to insist that an 
lvalue should be treated as having a given type, then struct declarations like those dlustrate e ow 
will be useful. It should be evident, though, that such techniques are implementation dependent. 

/V/zir/depends as well on the fact that char and float arguments are widened respectively to Int and 
double, so there are effectively only two sizes of arguments to deal with. Prmtf calls the library 
routines putchar to write out single characters and ftoa to dispose of floating-point numbers. 

printf(fmt, args) 
char fmt [ ]: 


char *s : 

struct I char »*charpp:): 
struct { double *doublep;}; 
int ‘ap. X, c : 

ap = iargs; /• argument pointer •/ 

for (;:) { , 

while( (c = ‘(mt-l-f) != % ) { 

return; 
putchar(c); 

switch (c = •fmt-l—h) { 

/• decimal •/ 
case'd': 

X = •ap-h-t-; 
if(x < 0) { 

X = -x; 

if(x<0) ( /‘is - infinity »/ 
printfC-32768’): 
continue; 




ptjlchar(’— 


) 

printd(x); 
continue; 

/• octal •/ 
case 'o': 

printo(*ap++): 
continue; 

/• float, double •/ 

'f': 

/• let ftoa do the real work •/ 
ftoa (•ap.doublep++); 
continue: 

/* ot aracter •/ 
case 'c': 

putchar(*ap-!-+); 
continue; 

/• string •/ 
case 's': 

s •ap.charpp++; 
while (c •S++) 
putchar(c); 
continue; 

} 

putchar(c); 


/• 

• Print n in decimal; n must be non-negative 


•/ 

printd(n) 

int a; 

If (a-n/10) 

printd(a); 

putchar(n%10 + '0'): 

} 

/• 

• Print n in octal, with exactly 1 leading 0 

•/ 

printo(n) 

* If (n) 

printo((n>>3)&017777); 

putchar({n&07)-H'0'): 


MH-1273-DMR- 
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1. Expressions. 
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APPENDIX 1 
Syntax Summary 


expression: 

primary 

* expression 
& expression 

— expression 
! expression 

• expression 
++ lvalue 

— lvalue 
lvalue 4-+ 
lvalue — 

size of expression 
ex,, ’ession binop expression 
exprt:SSion ? expression : expression 
lvalue asgnop expression 
expression , expression 

primary: 


identifier 

constant 

string 

{ expression) 

primary ( expression-list ^) 
primary [ expression ] 
lvalue . identifier 
primary — > identifier 

lvalue: 

identifier 

primary [ expression ] 
lvalue, identifier 
primary — > identifier 

♦ expression 
( lvalue ) 

The primary-expression operators 

{) [] . -> 

have highest priority and group left-to-right. The unary operators 
& - ! - -I-4- — sizeof 

have priority below the primary operators but higher than any binary operator, and group nghi- 
to-Ieft. Binary operators and the conditional operator all group left-to-right, and have priority 
decreasing as indicated: 

binop: 

* / % 

+ - 
>> << 

< > > = 


> = 
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& 

I 

&& 

II 

? ; 

Assignment operators all have the same priority, and all group right-to-left. 
asgnop. ^ 

The comma operator has the lowest priority, and groups left-to-right 
2. Declarations. 

declaration: 

decl-specifiers declarator-list^ ; 

decl-specifien: 

type-specifier 

sc-szecifier 

type-specifier sc-specifisr 
sc-^cifier type-specifier 

sc-specifier: 

auto 

static — 

extern 

register 

type-specifier 

int 

char 

float 

doi^ie 

struct { type-decl-list ) 
struct identifier { type-decNist 1 
Sta^t identifier 

declarator-lisv 

declarator 

declarator , declarator-list 

declarator: 

idendfier 
♦ declarator 
declarator () 

declarator [ consumt-expression^ ] 

{ declarator) 

rype-decl-lisv 

type-declaration 
type-declaration type-decl-list 

type-declaraticn: 

type-specifier declarator-list ; 


3. Statements. 

staiement: 

expression \ 
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{ statemenhlist ) 

if ( expression ) statement 

if ( expression) statement else statement 

while ( expression ) statement 

for ( expression^ ; expression ^; expression^ ) statement 

switch { expression ) statement 

case constant-expression: statement 

default : statement 

break ; 

continue ; 

return ; 

return ( expression ); 
goto expression ; 
identifier : statement 


statement-list: 

statement 

statement statement-list 


4. External definitions. 
program: 

external-definition 
external-definition program— 

external-definition: 

function-definition 

data-definition 

function-definition: 

type-specifier^ function-declarator function-body 

function-declarator: 

declarator { parameter-list^) 

parameter-list: 

identifier 

identifier , parameter-list 
function-body: 

type-decl-list f unci ion-statement 

function-statement: 

{ declaration-list statement-list } 

' Cpf 

data-definition: 

extern^ type-specifier^ init-declarator-Ust ^; 

init-declorator-1ist: 

init-declarator 

init-declorator , init-declarator-list 

init-declarator: 

declarator initializer^ 
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iritializer: 

constant , 

( constant^expre'jsion-ltst j 


constant-expression’list: 

constan t^-expression 

constant-expression 


constant-expression-list 


constant-e^vression: 

expression 

5. Preprocessor 

# define identifier token-string 

# include ’‘filename" 






